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The main goal of a route planning approach is to find a trajectory that safely 
transports the robot from one site to the next. Furthermore, it should provide 
an energy-efficient path so the computer can calculate it rapidly. This study 
develops a path-planning system for robots to approach the ball without 
collision. The Bayesian optimization algorithm (BOA) is used to identify the 
shortest path between the robot and the ball. BOA employs a probabilistic 
model to seek the optimum of an uncertain objective function efficiently. 
The performance of the BOA-based path planning system is compared to 
other optimization algorithms such as genetic algorithm, ant colony 
optimization, and firefly algorithm. BOA’s acquisition functions such as 
expected improvement, probability of improvement (PI), and upper 
confidence bound, are investigated. The exact locations of the robots and the 
ball are fed into optimization problems to discover the optimum path. The 


Path planning results reveal that the BOA system outperforms other systems in terms of 
Robotics computational time for planning the optimum path in dynamic situations and 
BOA-PI is the fastest algorithm. 
This is an open access article under the CC BY-SA license. 
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1. INTRODUCTION 

The topic of path planning or motion planning is one of the most basic in robotics, and it has been 
the focus of study for the last two decades. It is essential to map a path between two places or points for a 
robot if such a thing exists. The path should steer clear of any obstacles in the surrounding area and is often 
optimized in some manner, for instance, concerning the amount of time it takes or the amount of energy it 
requires [1]. This issue may be seen in many different contexts, such as a vehicular car-like robot and a 
translational robot. The difficulty of designing a route or path free from collisions becomes much more 
complex as the degree of freedom increases. After the robot has chosen a path that is appropriate for travel, 
for example, by giving power to the wheels, the robot will proceed along that path. When the robot follows 
the course that was programmed for it, it may conclude that the path that was mapped initially out is no 
longer viable because an obstruction is now in its way. This might be due to a need for more information 
about the surrounding area or to the obstruction being moveable and having migrated into a new location 
after the path was established. Some obstacle avoidance is required to stop the robot when a collision occurs. 
The obstacle avoidance, motion, and task planning processes may be carried out randomly. A network of 
interconnected components comprising a hybrid robotic planning system makes it possible to immerse a 
robot in a constantly changing environment. If the sub-tasks are planned, the visualization of the complexity 
of a scenario is a simple task [2]. In this work, an efficient path-planning strategy is developedusing a 
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probabilistic modeling approach. Recently, the need for capable path-planning systems has been increasing. 
The manual operations by people or animals are supplemented by robots that improve productivity and 
safety. For instance, in industrial assembly factories, robots are employed to move components into positions, 
do welding, and carry out several other tasks. They are controlled by programs to accomplish a purpose. The 
proposed path-planning strategy helps robots reach the ball without collision. 

A path planning technique is discussed in [3] under obstacles moving along arbitrary trajectories. 
The non-linear velocity obstacle considers the moving obstacle's velocity, its form, and the curvature of its 
path. It makes it possible to choose a single maneuver for obstacle avoidance (assuming such a thing even 
exists), preventing any number of obstacles from moving along any specified trajectory. The nonlinear 
v-obstacle may be formed as a time integral of the collision velocities, or its borders can be computed using 
analytic formulas. A novel strategy based on artificial potential fields (APF) is discussed in [4]. It offers a 
real-time methodology particularly useful for practical motion planners in uncertain dynamic situations. 
Compared to other methods, the use of Maxwell's equations to develop artificial magnetoquasistatic fields 
(AMF) as an extension of APF results in a more intelligent, natural, and predictable behavior than the 
alternatives. In addition to avoiding stationary obstructions, one of the primary goals of the AMF is to 
circumvent those in motion. The concept of adaptive dimensionality to accelerate the path planning process 
in dynamic situations is applied in [5]. It makes no assumptions about the dynamic model that describes its 
behavior. To be more specific, it takes time dimension only where there is a possibility that a collision will 
take place, and it plans in a state-space that has a low number of dimensions everywhere else. An approach 
for motion planning in dynamic settings is described in [6]. Motion planning refers to determining a path for 
a robot to follow within a scenario that contains both stationary and dynamic moving objects. A realistic 
approach based on a roadmap for the static section of the scene is developed. This road plan computes an 
estimated time-optimal course from a starting configuration to a destination configuration, ensuring the robot 
does not run into any moving obstacles along the way. 

The use of inverse kinematics to create a dexterous hand and a probabilistic road map planner for a robot 
is explored in [7]. The robot is designed to locate the most efficient path for harvesting ripe mushrooms from a 
field of plants planted at random. In addition, the applications of the biologically motivated meta-heuristic 
algorithms firefly algorithm (FA) and ant colony optimization (ACO) [8] have been researched and evaluated. An 
in-depth analysis and critical examination of the important contributions to path planning in dynamic contexts is 
conducted in [9]. It is the safest approach for velocity-based motion planning and has the capability for an 
autonomous robotic system to avoid collision with obstacles in the environment. The probabilistic robot, the 
probabilistic collision state (PCS), the partially closed-loop receding horizon control (PCLRHC), and the gross 
hidden Markov model are the four main approaches that are discussed in [10] for robot motion planning in 
dynamic environments. Markov model, the expectation and maximization method, and the Kalman filter are 
discussed and compared. A method for planning robot maneuvers in dynamic, crowded, and unpredictable 
surroundings is described in [11], [12]. Following the dynamic programming (DP) formulation, a partially 
closed-loop receding horizon control algorithm is implemented. This algorithm's approximation to the DP solution 
integrates prediction, estimation, and planning while also considering chance constraints arising from the robot's 
uncertain location and other moving agents. 

A comprehensive procedure for robot path planning is described in [13]. It uses random sample- 
based methods that can be effectively used to solve practical complex path-planning problems. It also 
considers complexity, completeness, effectiveness, and optimality. The potential field approach is often used 
in autonomously planning the movement of mobile robots [14]. A potential field approach is suggested as a 
solution for autonomous mobile robot path planning in a dynamic environment. The robot's location, 
velocity, acceleration, and physical size are all considered by the improved. A decoupling motion planning 
approach for two manipulators of a live operating robot is described in [15]. It is based on a hybrid 
implementation of the rapidly-exploring random trees (RRT) algorithm. The target offset RRT method, the 
bidirectional RRT algorithm, and the dynamic expansion RRT algorithm are all combined into one hybrid 
RRT algorithm. An optimization technique based on A* and the dynamic window approach is described 
in [16]. First, during global path planning, redundant turning points are removed to cut down on the total 
number of turning points. Second, by developing the motion model of the obstacle, the range of the dynamic 
obstacle is updated to the static environment, and the DWA algorithm's capacity to avoid dynamic obstacles 
is increased by updating the motion model. 

The framework suggested in [17] is capable of planning and optimizing collision-free robot 
trajectories by considering safety distance, path length, and execution time. When a new optimum path has 
been identified, the framework will direct the robot to modify its movement pathways accordingly. The most 
generally used path planning approaches for mobile robot navigation in static and dynamic situations are 
discussed in [18]. These methodologies have been utilized to navigate mobile robots in various environments. 
Global and local path planning methods and traditional and heuristic-based methodologies are also 
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investigated. An effective approach to the problem of creating optimum pathways in dynamic settings is 
described in [19]. Traditional route planning algorithms struggle to adapt to dynamic environmental changes, 
such as shifting obstacles. It uses a genetic algorithm (GA), which is a search heuristic influenced by the 
process of natural selection, to solve this problem. The challenges of dynamic environments, where obstacles 
and environmental conditions can change over time are discussed in [20]. It incorporates a mechanism to 
prevent ants from revisiting recently explored paths. A detailed description of the FA and its application to 
path planning is described in [21]. It adapts the FA algorithm to path planning by defining the fitness 
function and incorporating obstacle avoidance constraints. 

The paper's organization is as follows: an overview of the various strategies currently employed in 
path planning is provided in section 1. Section 2 describes the Bayesian optimization algorithm (BOA)-based 
path planning system. Section 3 discusses the performance of the proposed path planning strategy with other 
existing works. Section 4 concludes this work by summarizing the work carried out. 


2. METHODS AND MATERIALS 

This section discusses the proposed path planning strategy in more detail. Computers are incredibly 
competent in a wide variety of application areas, and as a result, the world in which we live has been 
completely revolutionized. Implementing these technologies has helped improve the quality of work that has 
historically been performed by people and has entirely automated some jobs, eliminating the need for humans 
to do such tasks. But, computer-controlled automated systems, such as in sports, are not yet at the point 
where they can completely replace humans. One of the areas in which computers still need to catch up to 
their human counterparts is their inability to design exact trajectories for the motions that a robot has to carry 
out to accomplish a certain job. It is not being claimed that computers are not capable of path planning; 
rather, the methods that are currently in use do not make it possible for computers to demonstrate the same 
level of speed in path planning, ability to generalize and accuracy of human skills. 

Evolutionary algorithms are a class of optimization algorithms that are inspired by the process of 
natural selection and evolution. They are often used to solve problems that are difficult or impossible to solve 
using traditional optimization techniques. The basic idea behind evolutionary algorithms is to create a 
population of potential solutions to a problem and then use a process of selection, crossover, and mutation to 
evolve that population towards better solutions over multiple generations. This process is designed to mimic 
the process of natural selection, in which individuals with better-adapted traits are more likely to survive and 
pass on their genes to the next generation. Many evolutionary algorithms exist, including genetic algorithms 
(GA) [22], FA, ACO, and DP. These algorithms can solve a wide range of optimization problems, including 
problems in engineering, finance, logistics, and more. Overall, evolutionary algorithms have proven to be a 
powerful and flexible tool for optimization and are widely used in many fields of research and industry. 
Figure 1 shows the proposed system architecture. 


Input Video Locating the ball Path planning algorithm 
(Robots playing (ROBOSOCCER model (Bayesian optimization 
football) and the Kalman filter) Algorithm) 


Send a signal to the robot 
to reach the ball 


Figure 1. Proposed system architecture 


Though the proposed system in this paper finds the shortest path to help the robot to reach the ball, 
the overall flow of the system shown in Figure 1 includes the object detection approach to find the ball in the 
image. To locate the ball in the robosoccer environment, the algorithm in [23] is employed before planning 
the best path without collision. It is a three-class classification problem using a convolution neural network 
[24] and Kalman filter as tracking algorithms in a robosoccer simulation environment. With classifying an 
autonomous, programmable, medium-sized humanoid robot (NAO), the ball, and the goalpost, the tracking of 
the NAO and the ball is also achieved from the first to the final framewithout manual intervention. 

One of the main advantages of BOA is that it can find the global optimum with a relatively small 
number of objective function evaluations. This is achieved by balancing exploration (generating new 
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candidate solutions) and exploitation (using the model to generate solutions expected to perform well). 
Another advantage is that the probabilistic model provides a measure of uncertainty, which can guide the 
search toward promising regions of the search space. Bayesian optimization has been successfully applied to 
various optimization problems in various fields, including engineering, finance, and machine learning [25]. 
The BOA to find the best path is discussed in the following sub-section. 


2.1. BOA 

Bayesian optimization is a popular optimization technique that uses a probabilistic model to search 
for the optimum of an unknown objective function efficiently. It is particularly useful when the objective 
function is expensive to evaluate or when the search space is high-dimensional. The basic idea behind BOA 
is to construct a probabilistic model, such as a Gaussian process, that represents the unknown objective 
function. The model is updated iteratively as new objective function evaluations become available. The 
model is then used to generate new candidate solutions, which are evaluated to update the model again. The 
algorithm continues this process until a satisfactory solution is found. The Bayesian optimization algorithm 
involves the following steps: 

a. Initialize a probabilistic objective function model using an appropriate surrogate model, such as a 
Gaussian process or a tree-based model. This study uses Gaussian process as an objective function for 
path panning. 

b. Choose an acquisition function that determines which point in the search space should be evaluated next. 
Examples of acquisition functions include expected improvement (EI), probability of improvement (PI), 
and upper confidence bound (UCB). 

c. Using the acquisition function and the probabilistic model, identify the next point in the search space to 
evaluate. 

d. Evaluate the objective function at the chosen point. 

Update the probabilistic model with the new observation. 

f. Repeat steps (a-e) until a stopping criterion is met, such as a maximum number of function evaluations or 
a satisfactory solution. 

g. Return the best solution found during the search 


D 


2.1.1. Objective function 
Let us consider the minimization problem in (1): 


findp *€ argminf (p) (1) 
pEX 


where the objective function is f: Xp C R? > R. The ambiguity set X is referred to as the optimization 
problem's search space with X; = X i.e., without the loss of generality and it is a box-constrained set. The 
observations (y) is corrupted by noise, which is formulated as (2): 


y=f(p)+a (2) 


where a~N(0,0;7)and o, represents standard deviation. The BOA’s first building block is the response 
surface. It includes a surrogate model to learn the objective function. Given an input point x, a surrogate 
model should deliver the predicted value of the genuine objective function and the associated uncertainty in 
the forecast. 

A response surface gives a probabilistic representation of an unknown objective function. Let us 
consider the posterior probabilities p(f |X, y) for N number of inputs (X) and observations (y) obtained by 
updating the prior probability p(f) by the likelihood p(f|X, f). Then the posterior probability is defined as (3): 


PIX y) x pfx, Pe) (3) 


The likelihood p(f |X, f) in (3) updates p(f) based on the collected data points. In this case, the data 
points represent the positions of NAOs and the ball's location. The initial assumptions about thp(f) is are 
very important to reduce the complexity of the BOA. 


2.1.2. Acquisition function 

The second main component of BAO is the acquisition function. The values of the response surface 
are fed to the acquisition function to search for the best fit for the problem, i.e., find the best path for the 
NAO to reach the ball without collision. 
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Throughout the process of finding the optimal balance, acquisition functions must make trade-offs 
between exploration and exploitation. During the exploration phase, objective assessments are focused on 
areas of the response surface with a high degree of uncertainty, while the exploitation phase prioritizes the 
selection of input sites at which predictions achieve small values (in the case of a minimization problem). 
The acquisition function acts as a map that directs the search for the optimal solution and contributes to the 
data efficiency of the optimization process that is applied to the optimization function f. 

a. Expected improvement (EI) 

EI is a commonly used acquisition function in Bayesian optimization, an iterative algorithm for the 
global optimization of black-box functions [26]. The basic idea behind EI is to find the next point to evaluate 
that is expected to have the greatest potential for improvement over the current best-observed value. In other 
words, EI evaluates the expected improvement of a potential new point in the search space over the current 
best value. To calculate EI, the algorithm first fits a probabilistic model to the observations of the objective 
function. This model is typically a Gaussian process, which captures the uncertainty of the function values 
and the correlations between points in the search space. 

Given the model, the EI for a candidate point is computed as the expected improvement over the 
current best value. This expected improvement is defined as the difference between the candidate point's 
value and the current best value, conditional on the observations and the model. The EI function is designed 
to balance exploration and exploitation, encouraging exploration of the search space to find promising 
regions while exploiting the information gained from previous evaluations to refine the search. By iteratively 
applying the EI function, Bayesian optimization can converge to the global optimum of the objective function 
with relatively few evaluations. 

b. Probability of improvement (PI) 

The PI function is designed to find the next point to evaluate in the search space with the highest 
probability of improving upon the current best value [27]. The PI function is defined as the probability that a 
candidate point will improve the current best value, given the observations of the objective function and the 
probabilistic model of the function. Specifically, the PI function is defined as: 


PI(x) = Pf (x) > f) (4) 


where x is a candidate point, f(x) is the value of the objective function at x, x* is the current best-observed 
point, and P(f (x) > f (x*)) is the probability that the value of the objective function at x is greater than the 
current best value. The PI function can be computed using the cumulative distribution function (CDF) of the 
Gaussian process model, which provides a probabilistic estimate of the likelihood that the objective function 
value at x is greater than the current best value. 

The PI function encourages exploration of the search space by assigning high values to points with a 
high probability of improvement while also exploiting the information gained from previous evaluations to 
refine the search. Bayesian optimization can converge to the global optimum of the objective function with a 
very small number of evaluations if the PI function is applied repeatedly throughout the process. 

c. Upper confidence bound (UCB) 

The UCB function [28] is designed to balance the exploration and exploitation trade-off by selecting 
the next point to evaluate in the search space based on a trade-off between the estimated mean and the 
uncertainty of the objective function value. The UCB function is defined as: 


UCB (x) = u(x) + ka(x) (5) 


where u(x) is the estimated mean of the objective function value at x, o(x) is the estimated uncertainty of the 
objective function value at x, and k is a constant that controls the balance between exploration and exploitation. 

The UCB function encourages search space exploration by assigning high values to points with high 
uncertainty and high expected improvement while exploiting the information gained from previous 
evaluations to refine the search. The constant k determines the balance between exploration and exploitation, 
with larger values of k promoting exploration and smaller values of k promoting exploitation. The UCB 
function is widely used in multi-armed bandit problems and other decision-making problems where the goal 
is to balance the trade-off between exploration and exploitation. In Bayesian optimization, the UCB function 
can be combined with a probabilistic model of the objective function to search for the global optimum with 
relatively few evaluations efficiently. 


3. RESULTS AND DISCUSSIONS 
The performances of the proposed BOA system are discussed in this section. The BOA system finds 
a time or energy-efficient trajectory for robots in Robosoccer environment. Thus, the time complexity of the 
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BOA system is analyzed using different acquisition functions in BOA and compared to other optimization 
algorithms. Each randomly selected NAObegins in a different configuration within the environment. The 
trajectory of a NAO is generated by selecting random objectives and then using the BOA to build the 
pathways to reach the ball. The start and target configurations for each NAO are chosen such that the 
resultant path would be sufficiently lengthy, guaranteeing that they would traverse a significant portion of the 
surrounding environment. If NAOs of different sizes are used, then the moving NAO will pass through any 
little gaps that the robot attempts to pass through, which will cause congestion at those small gaps. To avoid 
congestion, all NAO is considered to be the same size. Table 1 shows the time complexity to reach the ball 
without collision for five NAO using different acquisition functions in BOA. It is noted that the distance 
between the NAO and the ball is fixed for computing the time complexity. If another NAO blocks the path, 
the BOA finds an alternate path to reach the ball without collision. The time shown for each NAO is the 
average time from the results of ten runs. 


Table 1. The time complexity of the BOA approach for different acquisition functions 
Time complexity in milliseconds (ms) 


TAO BOA-EI __BOA-PI __BOA-UCB 
1 6.8 5.3 7.9 
2 7.2 5.45 8.5 
3 6.5 4.7 8 
4 8.1 6 9.8 
5 10.2 7.9 12 
Overall average 7.76 5.87 9.24 


It can be seen from Table 1 that the BOA-PI system computes the path to reach the ball very quickly 
than other combinations such as BOA-EI and BOA-CUB. The BOA-EI approach requires ~1.89 ms more 
than BOA-PI to compute the best path, whereas BOA-UCB requires ~3.3 ms more than BOA-PI. To 
compare the performance of the system, different optimization algorithms such as GA [26], [27], 
ACO [28], [29], and FA [30] with the same setting is analyzed. Figure 2 shows the performances of different 
optimization algorithms in terms of time complexity to find the best path. 

It can be seen from Figure 2 that the BOA approach with different acquisition functions outperforms 
other optimization algorithms, and among the acquisition function, PI is the fastest algorithm to find the path 
without collision. The time complexity of the proposed system is ~7 ms lesser than GA [29], [30] and ~5ms, 
~4ms lesser than ACO [31], [32] and FA [33], respectively. 


E Time (ms) 
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Figure 2. Performance comparison of BOA with other algorithms 


4. CONCLUSION 

The present study in this paper proposes an effective approach for path planning in robosoccer 
environment. It develops a rule that takes in the positions of NAOs and ball location as inputs and generates a 
comprehensive sequence of movements that can proficiently steer the robot along an optimal trajectory from 
its initial position to the ball location. The system employs BOA to determine the optimal path based on 
probabilistic modeling. The Gaussian process serves as the objective function, while an analysis is conducted 
on three distinct acquisition functions, such as EI, PI, and UCB, to determine the optimal trajectory. The 


Bulletin of Electr Eng & Inf, Vol. 13, No. 1, February 2024: 465-472 


Bulletin of Electr Eng & Inf ISSN:2302-9285 m) 471 


comparative analysis of the BOA-based system's performance is conducted in relation to other evolutionary 
algorithms, including GA, ACO, and FA. The findings demonstrate that BOA-PI offers the optimal route 
with minimal computational complexity of ~5 milliseconds. 
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